skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Paulson, Joel A"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Optimal design under uncertainty remains a fundamental challenge in advancing reliable, next-generation process systems. Robust optimization (RO) offers a principled approach by safeguarding against worst-case scenarios across a range of uncertain parameters. However, traditional RO methods typically require known problem structure, which limits their applicability to high-fidelity simulation environments. To overcome these limitations, recent work has explored robust Bayesian optimization (RBO) as a flexible alternative that can accommodate expensive, black-box objectives. Existing RBO methods, however, generally ignore available structural information and struggle to scale to high-dimensional settings. In this work, we introduce BONSAI (Bayesian Optimization of Network Systems under uncertAInty), a new RBO framework that leverages partial structural knowledge commonly available in simulation-based models. Instead of treating the objective as a monolithic black box, BONSAI represents it as a directed graph of interconnected white- and black-box components, allowing the algorithm to utilize intermediate information within the optimization process. We further propose a scalable Thompson sampling-based acquisition function tailored to the structured RO setting, which can be efficiently optimized using gradient-based methods. We evaluate BONSAI across a diverse set of synthetic and real-world case studies, including applications in process systems engineering. Compared to existing simulation-based RO algorithms, BONSAI consistently delivers more sample-efficient and higher-quality robust solutions, highlighting its practical advantages for uncertainty-aware design in complex engineering systems. 
    more » « less
    Free, publicly-accessible full text available January 1, 2027
  2. Designing molecules that must satisfy multiple, often conflicting, objectives is a central challenge in molecular discovery. The enormous size of the chemical space and the cost of high-fidelity simulations have driven the development of machine learning-guided strategies for accelerating design with limited data. Among these, Bayesian optimization (BO) offers a principled framework for sample-efficient search, while generative models provide a mechanism to propose novel, diverse candidates beyond fixed libraries. However, existing methods that couple the two often rely on continuous latent spaces, which introduce both architectural entanglement and scalability challenges. This work introduces an alternative, modular “generate-then-optimize” framework for de novo multiobjective molecular design/discovery. At each iteration, a generative model is used to construct a large, diverse pool of candidate molecules, after which a novel acquisition function, qPMHI (multipoint Probability of Maximum Hypervolume Improvement), is used to optimally select a batch of candidates most likely to induce the largest Pareto front expansion. The key insight is that qPMHI decomposes additively, enabling exact, scalable batch selection via only a simple ranking of probabilities that can be easily estimated with Monte Carlo sampling. We benchmark the framework against state-of-the-art latent-space and discrete molecular optimization methods, demonstrating significant improvements across synthetic benchmarks and application-driven tasks. Specifically, in a case study related to sustainable energy storage, we show that our approach quickly uncovers novel, diverse, and high-performing organic (quinone-based) cathode materials for aqueous redox flow battery applications. 
    more » « less
    Free, publicly-accessible full text available December 21, 2026
  3. Rapid mixing is a critical step in many nanoparticle syntheses that can impact the ability to scale production from bench to industrial levels. This study combines experimental and computational approaches to characterize mixing dynamics in crossflow jet mixing reactors (JMRs) with millimeter-scale internal dimensions. The Villermaux-Dushman reaction system is used to quantify experimental mixing times across different reactor sizes and flow rates. Complementary computational fluid dynamics (CFD) simulations assess changes in the state of the flow and estimate mixing times under varying operating conditions. Mixing times derived from CFD results agree well with the experimental results for mixing indices between 0.95 and 0.98. To demonstrate the impact of mixing on nanoparticle formation, we synthesize polybutylacrylate-b-polyacrylic acid (PBA-PAA) block co-polymer nanoparticles, confirming the existence of a critical flow rate beyond which particle size stabilizes. Additionally, we produce polylactic acid-co-glycolic acid (PLGA) nanoparticles incorporating a hydrophobic dye, achieving an average particle size below 300 nm at a throughput of ∼ 1.3 kg/day. These results provide insights into optimizing JMRs for high-throughput, reproducible nanoparticle synthesis, bridging the gap between benchtop and industrial-scale production. 
    more » « less
    Free, publicly-accessible full text available July 15, 2026
  4. Free, publicly-accessible full text available June 1, 2026
  5. Free, publicly-accessible full text available February 1, 2026
  6. The discovery of molecules with optimal functional properties is a central challenge across diverse fields such as energy storage, catalysis, and chemical sensing. However, molecular property optimization (MPO) remains difficult due to the combinatorial size of chemical space and the cost of acquiring property labels via simulations or wet-lab experiments. Bayesian optimization (BO) offers a principled framework for sample-efficient discovery in such settings, but its effectiveness depends critically on the quality of the molecular representation used to train the underlying probabilistic surrogate model. Existing approaches based on fingerprints, graphs, SMILES strings, or learned embeddings often struggle in low-data regimes due to high dimensionality or poorly structured latent spaces. Here, we introduce Molecular Descriptors with Actively Identified Subspaces (MolDAIS), a flexible molecular BO framework that adaptively identifies task-relevant subspaces within large descriptor libraries. Leveraging the sparse axis-aligned subspace (SAAS) prior introduced in recent BO literature, MolDAIS constructs parsimonious Gaussian process surrogate models that focus on task-relevant features as new data is acquired. In addition to validating this approach for descriptor-based MPO, we introduce two novel screening variants, which significantly reduce computational cost while preserving predictive accuracy and physical interpretability. We demonstrate that MolDAIS consistently outperforms state-of-the-art MPO methods across a suite of benchmark and real-world tasks, including single- and multi-objective optimization. Our results show that MolDAIS can identify near-optimal candidates from chemical libraries with over 100,000 molecules using fewer than 100 property evaluations, highlighting its promise as a practical tool for data-scarce molecular discovery. 
    more » « less
    Free, publicly-accessible full text available October 8, 2026
  7. Rapid mixing is a critical step in many nanoparticle syntheses that can impact the ability to scale production from bench to industrial levels. This study combines experimental and computational approaches to characterize mixing dynamics in crossflow jet mixing reactors (JMRs) with millimeter-scale internal dimensions. The Villermaux-Dushman reaction system is used to quantify experimental mixing times across different reactor sizes and flow rates. Complementary computational fluid dynamics (CFD) simulations assess changes in the state of the flow and estimate mixing times under varying operating conditions. Mixing times derived from CFD results agree well with the experimental results for mixing indices between 0.95 and 0.98. To demonstrate the impact of mixing on nanoparticle formation, we synthesize polybutylacrylate-b-polyacrylic acid (PBA-PAA) block co-polymer nanoparticles, confirming the existence of a critical flow rate beyond which particle size stabilizes. Additionally, we produce polylactic acid-co-glycolic acid (PLGA) nanoparticles incorporating a hydrophobic dye, achieving an average particle size below 300 nm at a throughput of ~1.3 kg/day. These results provide insights into optimizing JMRs for high-throughput, reproducible nanoparticle synthesis, bridging the gap between benchtop and industrial-scale production. 
    more » « less
    Free, publicly-accessible full text available July 1, 2026
  8. Symbolic regression (SR) is an emerging branch of machine learning focused on discovering simple and interpretable mathematical expressions from data. Although a wide-variety of SR methods have been developed, they often face challenges such as high computational cost, poor scalability with respect to the number of input dimensions, fragility to noise, and an inability to balance accuracy and complexity. This work introduces SyMANTIC, a novel SR algorithm that addresses these challenges. SyMANTIC efficiently identifies (potentially several) low-dimensional descriptors from a large set of candidates (from ∼105 to ∼1010 or more) through a unique combination of mutual information-based feature selection, adaptive feature expansion, and recursively applied l 0 -based sparse regression. In addition, it employs an information-theoretic measure to produce an approximate set of Pareto-optimal equations, each offering the best-found accuracy for a given complexity. Furthermore, our open-source implementation of SyMANTIC, built on the PyTorch ecosystem, facilitates easy installation and GPU acceleration. We demonstrate the effectiveness of SyMANTIC across a range of problems, including synthetic examples, scientific benchmarks, real-world material property predictions, and chaotic dynamical system identification from small datasets. Extensive comparisons show that SyMANTIC uncovers similar or more accurate models at a fraction of the cost of existing SR methods. 
    more » « less
    Free, publicly-accessible full text available February 12, 2026